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Abstract 



C/3 

q. 

r*j ■ We construct a piecewise onto 3-to-l dynamical system on the pos- 

itive quadrant of the unit circle, such that for rational points (which 
correspond to normalized Primitive Pythagorean Triples), the associ- 
ated ternary expansion is finite, and is equal to the address of the PPT 
on Barning's [D] ternary tree of PPTs, while irrational points have in- 
£> ' finite expansions. The dynamical system is conjugate to a modified 

CN . Euclidean algorithm. The invariant measure is identified, and the sys- 

tem is shown to be conservative and ergodic. We also show, based 

^O ' on a result of Aaronson and Denker [2J, that the dynamical system 

can be obtained as a factor map of a cross-section of the geodesic flow 

f— s . on a quotient space of the hyperbolic plane by the group T(2), a free 

subgroup of the modular group with two generators. 



1 Introduction 



The starting point of this paper is a theorem, attributed to Barning jH] , on 
the structure of the set of Primitive Pythagorean Triples, or PPTs. Recall 
that a PPT is a triple (a, b, c) of integers, with a,b,c> 0, gcd(a, b) = 1 and 

a 2 + b 2 = c 2 . (1) 

Clearly, if (a, b, c) is a PPT, then one of a, b must be odd, and the other even. 
Barning j^j, and later independently several others [7[ ITT] [F2\ IT^l Ho] ITTj I2U] 
(see also [El), showed: 
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Theorem 1. Define the matrices 



Mi 




Mo 




M, 




(2) 



Any PPT (a, b, c) with a odd and b even has a unique representation as the 
matrix product 

b =M dl M d2 ...M dn I 4 , (3) 

for some n > 0, (di, <i 2 , . . . , d n ) G {1,2, 3} n . Any PPT (a, b, c) with a even 
and b odd has a unique representation as 




M dl M d2 ...M dr | 3 



(4) 



for some n > 0, (d\, d%, 
flUD, © is a PPT. 



d n ) € {1, 2, 3} n . Any triple of one of the forms 



In some of the papers where this was discussed, the theorem has been 
described as placing the PPTs (a, b, c) with a odd and b even on the nodes of 
an infinite rooted ternary tree, with the root representing the "basic" triple 
(3,4,5), and where each triple (a,b,c) has three children, representing the 
multiplication of the triple (considered as a column vector) by the three ma- 
trices Mi, M2, M3. In this paper, we consider a slightly different outlook. We 
think of the sequence (d±, g?2, • • • , d n ) in (jSJ), (JU) as an expansion correspond- 
ing to the triple (a, b, c), over the ternary alphabet {1, 2, 3}. We call the d^s 
the digits of the expansion. To distinguish between PPTs with a odd, b even 
and those with a even, b odd, we affix to the expansion a final digit d n +i, 
which can take the values oe (a odd, b even) or eo (a even, b odd). So we 
have a 1-1 correspondence 

00 
(a, b, c)e PPT ^ (d 1 ,d 2 ,...,d n+1 )e{J{l,2,3} n x{oe,eo}. 

n=0 



Several questions now come to mind: 



• Is there a simple way to compute the expansion of a PPT? (Yes - this 
is contained in the proof of Theorem 1.) 

• As is easy to see and has been known since ancient times, the mapping 
(a,b,c) — > (a/c,b/c) gives a 1-1 correspondence between the set of 
PPTs and the rational points (x, y) on the positive quadrant Q of 
the unit circle. Are the digits in the expansion piecewise continuous 
functions on the quadrant? (Yes.) Can one define a ternary expansion 
for irrational points'! (Yes.) 

• Can interesting things be said about the expansion (d 1; <i 2 , • • • , d n+ i) of 
a random PPT, chosen from some natural model for random PPTs, say 
by choosing uniformly at random from all PPTs (a, b, c) with c < N 
and letting N — ► oo? (Yes.) 

• Do these questions lead to interesting mathematics? (Yes - they lead 
to a dynamical system on Q with interesting properties.) 

It is the goal of this paper to answer these questions. The basic obser- 
vation is that it is in many ways preferable to deal with points (x, y) on 
the positive quadrant Q of the unit circle, instead of with PPTs. In the 
proof of Theorem 1, we shall see that there is a simple transformation (a 
piecewise linear mapping) that takes a PPT (a, b, c) to the PPT (a', b', d) 
that corresponds to its parent on the ternary tree, i.e., if (a, b, c) has expan- 
sion (di, . . . , d n +i) then (a', b', c') has expansion (d 2 , • • • , d n +i). A standard 
trick in dynamical systems is to rescale such transformations, discarding 
information that is irrelevant for the continuing application of the transfor- 
mation; this is done, for example, when rescaling the Euclidean algorithm 
mapping (x, y) — > (y, x mod y) to obtain the continued fraction transfor- 
mation x — > {l/x}. When we apply this idea to our case, we obtain the 
following result. 

Theorem 2. Let 

Q = {(x, y) : x > 0, y > 0, x 2 + y 2 = 1}. 
Define the transformation T : Q — > Q by 

\2-x-2y\ \2-2x-y\ 



T(x,y) 



3 - 2x - 2y ' 3 - 2x - 2y 
3 



Define d : Q — » {1, 2, 3, oe, eo} fry 



d(x,y) 



1 4/3 < x/y, 

2 3/4 < x/y < 4/3, 

3 x/y < 3/4, 



oe 
eo 



(*>*) = (14) 
(*.v) = (if) 



T/ien: 



(%) // (x, y) = (a/c, b/c) G <2 H Q 2 is a rational point of Q, with a/c, b/c 
in lowest terms (so (a, b, c) is a PPT), then for some n > 0, T n+1 (x, y) 
(the (n + l)-th iterate ofT) will be equal to (1,0) or (0, 1), and if we 
define 

d k = d(T k -\x,y)), k = l,2,...,n + l, 

then (dx,d2, ■ ■ ■ ,d n+ x) is the ternary expansion (with the last digit in 
{oe,eo}) corresponding to the PPT (a,b,c) as in ©, 0- 

(it) If (x,y) G Q is an irrational point, then T n (x,y) G Q for all n > 0, 
and the sequence 

d k = d{T k -\x,y)), k>\, 

defines an infinite expansion for (x,y) over the alphabet {1,2,3}, with 
the property that it does not terminate with an infinite succession of 1 's 
or with an infinite succession of 3 's. 

(Hi) Any sequence (dk)k>i over the alphabet {1,2,3} which does not ter- 
minate with an infinite succession of 1 's or an infinite succession of 
3 's determines a unique (irrational) point (x, y) G Q such that dk = 
d(T k -\x,y)), k>l. 

Examples. Here are some examples of points in Q and their expansions. 
If (dk)i<k<n is an expansion (finite or infinite), we denote by [di, c^, • • •] the 
point (x, y) G Q which has the given sequence as its expansion. 

$$3 : h IBS : B 



(35/37,12/37) = [1,1, oe] (65/97,72/97) = [2,1, oe] (33/65,56/65) = [3,l,oe] 

(77/85,36/85) = [l,2,oe] (119/169,120/169) = [2,2, oe] (39/89,80/89) = [3,2,oe] 
(45/53,28/53) = [l,3,oe] (55/73,48/73) = [2,3, oe] (7/25,24/25) = [3,3,oe] 

(V2/2,V2/2) = [2,2,2,2,...] (2/^5,1/^5) = [1,2,1,2,...] 
(1/2,^/2) = [3,1,3,1,...] (l/y/E,2/y/5) = [3,2,3,2,...] 
(V3/2,l/2) = [1,3,1,3,...] (3/VT0,l/Vl0) = [1,1,2,1,1,2,...] 

(cos 1, sin 1) = [3, 1, 1, 3, 1, 1, 1, 1, 3, 1, 1, 1, 1, 1, 1, 3, 1, 1, 1, 1, 1, 1, 1, 1, 3, . 

(see section 5) 

(cos(1/tt), sin(l/7r)) = [1, 1, 2, 1, 2, 2, 3, 3, 3, 3, 3, 3, 3, 1, 3, 3, 3, 3, 3, 3, 2, . . .] 

(this is meant as an example of a "typical" expansion - see section 4) 

'4(n + l) 2 -l 4(n + l) 



[1, 1, ... , 1, oe] (n times "1' 
[2, 2, ... , 2, oe] (n times "2' 



4(n + l) 2 + l'4(n + l) 2 + 1 
On a n + (-1) 

> 



where (a n )n>o = (3,21,119,697,...) and (c n ) n >o = (5,29,169,985,...) are 
sequences A046727 and A001653, respectively, in The On-Line Encyclopedia 
of Integer Sequences 



After constructing the dynamical system associated with the ternary ex- 
pansion, the next step is to study its properties. What does the expansion 
of a typical point look like? To a trained eye, the answer is contained in the 
following theorem. 

Theorem 3. Let ds denote arc length on the unit circle. The dynamical 
system (Q,T) possesses an infinite invariant measure \x, given by 

d/j,(x,y) 



y/(l-x)(l-y) 

With the measure ft, the system (Q, T, fi) is a conservative and ergodic infinite 
measure-preserving system. 

The invariant measure \x encodes all the information about the statistical 
regularity of expansions of "typical" points. In section 4 we shall state more 
explicitly some of the number-theoretic consequences of Theorem 3. 



Recall (|13j. Theorem 225) that the general parametric solution of the 
equation (|TJ with a, b > coprime, a odd and b even is given by 

a = m 2 — n 2 , b = 2mn, c = m 2 + n 2 , (5) 

where m,n have opposite parity, m > n > 0, and gcd(m, n) = 1. A roughly 
equivalent statement is that the map 

fl-t 2 It 
D:t 



1 + t 2 ' 1 + fr 

maps the extended real line injectively onto the unit circle, and maps the 
rational numbers, together with the point at infinity, onto the rational points 
of the circle. Note also that the interval (0, 1) is mapped onto the positive 
quadrant Q. It is thus natural, in trying to understand the behavior of the 
dynamical system (Q,T), to conjugate it by the mapping D, to obtain a new 
dynamical system on (0,1). This leads to the following result (the precise 
meaning of the last statement in the theorem will be explained later): 

Theorem 4. (Q, T, //) is conjugate, by the mapping D, to the measure 
preserving system ((0, 1),T, v), where 



t 



I — 2 1 3 ' 

f{t) = (D- l oToD)(t) = { |-2 !<*<!, 

2-1 l<t<l, 

, . . 1 dt 

du(t) 



y/2 t(l-ty 

The mapping T is the scaling of a modified slow (subtractive) Euclidean al- 
gorithm. 

There has been some interest in obtaining natural dynamical systems ap- 
pearing in number theory as factors of certain cross sections of the geodesic 
flow on quotients of the hyperbolic plane by a discrete subgroup of its isom- 
etry group. This has been done by Adler and Flatto jU E] and by Series |22] 
for the continued fraction transformation, and by Adler and Flatto |3] for 
Renyi's backward continued fraction map (see also [3123)- in both of these 
cases, the underlying surface was the modular surface, which is the quotient 
of the hyperbolic plane by the modular group T = PSL(2, Z). Representing 
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0.2 0.4 0.6 0.8 

Figure 1: The interval map T 



a system as a factor of a cross-section of a geodesic flow enables one to derive 
mechanically (without guessing) an expression for the invariant density, and 
to deduce various properties of the system. Series (|2.3], Problem 5.25(i)) 
posed the general problem of replicating this idea for other number- and 
group-theoretical dynamical systems. 

It turns out that the map T also admits such a representation. Aaronson 
and Denker |2] studied a certain cross-section of the geodesic flow on a differ- 
ent quotient of the hyperbolic plane, which is the quotient by the congruence 
subgroup T(2) of the modular group, a free group with two generators. They 
obtained the map T as a factor of that cross section, and used this to de- 
rive results on the asymptotic behavior of the Poincare series of the group 
r(C\Z) of deck transformations of C\Z. Since in their paper the motivation 
was completely different from ours, and the connection to the map T was 
not known, we find it worthwhile to include here a version of their result. 

Theorem 5. Let (H, (<ft)tem.) be the upper half-plane model of the hyperbolic 
plane, with the associated geodesic flow <pt '■ T\(M) — > Ti(HI), where Ti(H) = 
HI x S 1 is the unit tangent bundle o/H. Define the group of isometries ofM 



and let M = H/r(2) be the quotient space o/H by T(2), which has a funda- 
mental domain 



zeM:\Rez\<l, 



2 



1 

> - 

2 



Let Ti : HI — ► M be the quotient map. Let ((p t ) te j% be the geodesic flow on M. 
LetX'dT^ 



X' = | (z,u) G OF x S 1 : z + eu G F for small e > i, 

and let X C Ti(M) ; X = d7r(X') be the natural section of M corresponding 
to the fundamental domain F of all inward-pointing vectors on the boundary 
of F. Let r : X — » X be the section- or first-return map of the geodesic flow, 
namely 

where 

t u = inf{£ > : Tp t (u) G X}. 

Then the section map {X,t) admits the map (Q,T) as a factor. That is, 
there exists an (explicit) function E : X — » Q such that T o E = E o r. 

Theorem 5 is an immediate consequence of Aaronson and Denker's result 
(|2j, section 4) and Theorem 4. To describe explicitly the factor map E, 
define p\{x, 5, e) = x. Then, in the notation of their paper, 



E = D O p x O 7] 7T + , 

with the "juicy" parts being our map D and and the map 7r + , which assigns to 
a tangent vector the hitting point on the real axis of the geodesic emanating 
from the lifting of the tangent vector to dF x S 1 . For more details consult 

m 

The congruence subgroup T(2) also appears in the paper by Alperin [2j, 
which discusses the ternary tree of PPTs. 

In the next section, we reprove Theorem 1, and show how the linear map- 
pings involved in the construction of the ternary tree of PPTs can be scaled 
down to produce the transformation T. This will result in a proof of Theorem 
2. In section 3, we prove Theorem 4 and discuss the connection to modified 
Euclidean algorithms. The ergodic properties of the system will be derived, 
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using standard techniques of infinite ergodic theory, proving Theorem 3. In 
section 4 we discuss applications to the statistics of expansions of random 
points on Q and random PPTs. Section 5 has some remarks on points with 
special expansions. 

2 Construction of the dynamical system 

2.1 The piecewise linear transformation 

First, we recall the ideas involved in the proof of Theorem 1. We follow the 
elegant exposition of [TH] . 

We shall consider solutions of (JTJ) with gcd(a, b) = 1, and c > 0. Define 

PPT = {{a,b,c) e I? : gcd(a,b) = 1, a, b, c> 0, a 2 + b 2 = c 2 }, 
the set of PPTs, and 
SPPT = {(a,b,c) GZ 3 :gcd(a,6) = 1, c> 0, a 2 + b 2 = c 2 }, 
the set of signed PPTs. 

The basic observation is that the equation (JTJ has three symmetries. Two of 
them are the obvious symmetries a — > —a, b — > —b (we ignore the symmetry 
c — » — c, since we are only considering solutions with c > 0). The third 
symmetry is not so obvious, but becomes obvious when the correct change 
of variables is applied. Define new variables m, n, q by 

m = c — a a = q + m 

n = c — b < ► b = q + n 

q = a+b—c c = q+m+n 

In the new variables, (0) becomes 

q 2 = 2mn. (6) 

There is therefore a third natural involution on the set SPPT of solutions 
of (HJ), given in m,n,q coordinates by q — > —q. So, starting from a solution 
(a, b, c) G SPPT with associated variables (m, n, q) and setting q' = —q, vn! = 
m, n' = n, we arrive at a new solution (a', b', d) given by 



a' -- 


= q' + m! 


= a — 2q = 


= 2c- a -2b 


b' -- 


= q' + n' 


= b-2q = 


= 2c -2a -b 


d -- 


= q' + m! + n' 


= c-2q = 


= 3c -2a -2b 




or in matrix notation 

:)-(3 

The matrix I is an involution, i.e. I 2 = id 3 . It is easy to see that a', V are 
also coprime, and 

c' = 3c - 2(o + 0) > 3c - 2^ • Va 2 + o 2 = (3 - 2v / 2)c > 0. 

So i" maps SPPT to itself. / has the fixed points (1,0, 1) and (0, 1, 1). We 
claim that /(PPT) = SPPT\(PPTU{(1, 0, 1), (0, 1, 1)}). Indeed, this simply 
means that if (a, 6, c) G PPT, then at least one of a', V is negative, or in other 
words that 2c < max(a + 26,2a + b). Assume for concreteness that a > b, 
then 2c < 2a + b if 2 < 2x + y, where (x, y) = (a/c, b/c) G Q fl {x > y}, or 
equivalently if 

±<{{x,y), (2/V5,l/>/5)). (7) 

The point (2/\/5, l/v5) lies on the arc Q fl {a; > 2/}, and one checks easily 
that there is an equality at one end (1, 0) of the arc, and a strict inequality 
at the other end (1/V2, 1/V2). So © holds. 

Having shown that if (a, 6, c) G PPT, then (a', 6', c') is a signed PPT with 
one of a, 6 negative, we can forget about the signs of a', b' and obtain a new 
triple (a", 6", c") = (|a'|, |6'|, c'). c' is strictly less than c, since c' = c — 2q and 
g = a + 6-c = a + 6- V^TF > on PPT. The new triple will be a PPT, 
except when (a, 6, c) = (3, 4, 5) or (4, 3, 5), in which case (a", b" , c") will equal 
(1,0,1) or (0,1,1), respectively. If (a",b",c") is a PPT, there are precisely 
three PPTs (a, 6, c) leading to it via this procedure - corresponding to the 
three possible sign patterns a' < < b'; a', b' < 0; a' > > b' (we ruled 
out a 1 , b' > 0) - and they can easily be recovered, as follows: If a' < < 6', 
then 
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(with Mi as in (J2J)). Similarly we get 

a" \ ( a\ ( a" 

.1/, 1 b" , if a', b' < 0, or 6 = M 3 6" | if,/ > > //, 

We are now ready to prove Theorem 1. First, from the above discussion 
it follows that M 1 ,M 2 ,M 3 take PPTs to PPTs with a strictly larger third 
coordinate. In particular, any triple of one of the forms (J3J), (J3J) is a PPT. 
Next, for (a, 6, c) G PPT define 

S(a,b,c) = (|2 — a — 26[, |2 — 2a — 6|, 3 — 2a — 26), 

{1 2c - a - 26 < < 2c - 2a - 6 

2 2c - a - 26, 2c - 2a - b < 

3 2c - a - 26 > > 2c - 2a - 6 

Sk(a, b, c) = 5(5 (a, 6, c)), A; = 1,2, ... ,n(a, 6, c), 
n(a, 6, c) = max{n > : S n (a, 6, c) G PPT}. 

The above discussion can be summarized by the equations 

M S{a , b ,c)(S(a,b,c))\ sImA 6 ))=d (d= 1,2,3). (8) 

Let (a, 6, c) G PPT with a odd and 6 even. It is easy to see that M\, M 2 , M 3 
preserve the parity of a, 6, so (a, 6, c) cannot have a representation (jlj. We 
claim that it satisfies Q with dk = Sk{a, 6, c), k — 1, . . . , n(a, 6, c), and that 
this representation is unique. The proof is by induction on c. The claim holds 
for the basic triple (3, 4, 5), because M 1; M 2 , M 3 increase the third coordinate. 
Assume that it holds for all odd-even PPTs with third coordinate < c. Then 
in particular this is true for (a', 6', c') = S(a, 6, c), since we know that d < c. 
So we may write 




a' \ 


i 


i 


^3 


« 


= M ei M e2 . 




4 


d 




1 


I 5 



with Ck = Sk(a', b',d), 1 < k < m = n(a', b', d) = n(a, 6, c) — 1. We have 
e k = S(S k ~ l (a', 6', d)) = 5(S k (a, 6, c)) = 4+i(a, &, c) = 4+i, 
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where we denote d k = 8k(a,b,c). Therefore, by (jSJ), 

/a'\ 

= M 5(aM \ V \ = M dl M d2 . . . M dm+1 



which is our claimed representation. Uniqueness follows immediately by not- 
ing that by (jSJ), di is determined by the sign pattern of (2c— a— 26, 2c— 2a— b), 
and continuing by induction. Theorem 1 is proved. 

2.2 Scaling the transformation 

It is now easy to rescale the transformation S to obtain a transformation 
T from Q to its closure. If (x, y) = (a/c, b/c) G Q fl Q 2 is a rational point 
of <2, which corresponds to the PPT (a, 6, c), then the triple S(a, 6, c) = 
( 1 2c — a — 2b\, 1 2c — 2a — b\, 3c — 2a — 26) corresponds to the point 

2c-a-26| |2c-2a-6|\ ( \2 - x - 2y\ \2c - 2a - b\ 



3c -2a -26' 3c -2a -26/ \3-2x-2y' 3c - 2a - 26 

in Q. This precisely accounts for our definition of T in Theorem 2. To 
explain why the function d is the correct rescaling of 5, observe, for example, 
that S(a, 6, c) = 1 iff 



2-x-2y<0<2-2x-y 



j.<((x,y), (1/V5,2/V5)> 

<(x, 2 /),(2/v / 5,l/v / 5)><^, 



which some inspection reveals to hold exactly on the (open) circular arc 
connecting the point (1,0) with the point (4/5,3/5). This corresponds to 
the condition x/y > 4/3 in the definition of d. Similarly, it can be checked 
that S(a,b,c) = 2 if (x,y) lies on the circular arc between (4/5,3/5) and 
(3/5, 4/5), and S(a, 6, c) = 3 if (x, y) lies on the circular arc between (3/5, 4/5) 
and (0, 1). These arcs form the generating partition of the ternary expansion 
- see Figure 2. 

Figure 3 shows the graph of the map obtained by parametrizing the quad- 
rant Q in terms of the angle (multiplied by 2/n, to obtain a map on the 
interval (0, 1)). As Theorem 4 may imply, this is not the best parametriza- 
tion, but it gives a good graphical illustration of the behavior of the mapping 
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d= 1 



Figure 2: The quadrant Q and the generating partition 




0.2 0.4 0.6 0.8 1 

Figure 3: The conjugate map F^^-oToF, where F(t) = (cos(irt/2), sin(7r£/2)) 



T. Note that, contrary to appearance, the map is not linear on the middle 
interval! 

We have proved part (i) of Theorem 2. For part (ii), observe that all 
the iterates of an irrational point (x, y) G Q under T remain irrational, 
since T is defined by piecewise rational functions with integer coefficients 
which are invertible (so as before, given d(x,y), we can recover (x,y) from 
T(x,y) by a rational function with integer coefficients, hence if T(x,y) is 
rational, so is (x,y)). For T n (x,y) to be in Q \ Q = {(1,0), (0, 1)}, we must 
have T n_1 (a;,|/) = (3/5,4/5) or (4/5,3/5), and this cannot happen for an 
irrational point. Therefore T n (x,y) is defined for all n > 1, as claimed. 
The resulting sequence of digits dk = d(T k ~ 1 (x, y)) cannot terminate with an 
infinite succession of l's. Indeed, on the first interval (0, - arctan(3/4)) of the 
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generating partition of the mapping G = F^ 1 o T o F (Figure 3) it is easy to 
verify that G' is increasing and satisfies G'(0) = 1. Therefore if t is some point 
on this interval, then the sequence of iterates G k (t) satisfies G k {t) > G'{t) k • 
t and therefore must eventually leave this interval (so the corresponding 
succession of l's in the expansion will terminate). A symmetrical argument 
applies for the third interval of the generating partition, implying that no 
infinite expansion terminates with an infinite succession of 3's, and part (ii) 
is proved. 

Turn to the final part (iii) of Theorem 2. Again we use the mapping G 
in Figure 3. Let I\ = (0, | arctan(3/4)), J 2 = (| arctan(3/4), | arctan(4/3)), 
Is = (- arctan(4/3), 1) be the intervals of the generating partition of G. 
Given an infinite expansion (dk)k>i that does not terminate with an infinite 
succession of l's or an infinite succession of 3's, we must show that there is 
a unique number t G (0, 1) such that G fc_1 (t) G Id k for all k > 1. 

Consider, for any n > 1, the cylinder set 

A n = {te (0, 1) : G k -\t) G I dk for 1 < k < n}. 

(A n ) n >i is a decreasing sequence of non-empty open intervals. By compact- 
ness, the intersection of their closures contains at least one point t. The 
condition on the sequence (dk)k>i implies that t is in fact in the intersection 
of the open intervals; otherwise, t is an endpoint, say of A n , but that would 
imply that d^ — 1 for all k > n or d^ = 3 for all k > n. 

We have shown existence of a number with a prescribed expansion. But 
uniqueness also follows, since, as Figure 3 shows, any appearance of a "2" 
digit, or a non-1 digit following a succession of l's, or a non-3 digit following 
a succession of 3's, entails a shrinkage of the corresponding set A n by at least 
a constant factor bounded away from 1. So the diameter of the A n 's goes to 
0, and their intersection contains at most one point. Theorem 2 is proved. 

(Here is another argument demonstrating uniqueness: any two irrational 
points on Q are separated by a rational point; after a finite number of appli- 
cations of T, the rational point will be mapped to (3/5, 4/5) or to (4/5, 3/5), 
and the images of the two irrational points will be contained in different 
elements of the generating partition, implying a different first digit in their 
expansions.) 



14 



3 The modified Euclidean algorithm 

3.1 Some computations 

The inverse function of D is easily computed to be 

1 — x 



D-\x,y) 

y 

Using this, a routine computation, which we omit, shows that indeed 

f = (D~ l oToD). 

We show that the measure v is T-invariant. If dvit) = f(t)dt, the invariant 
density must satisfy 

The inverse branches of T are given by 

*(*)= 1^(0,1/3), 

^(t) = ^G (1/3,1/2), 

^(*) = ^e (1/2,1), (10) 



for which 



iT'iF.m = (l-2F 1 (t))- 2 = (l + 2t) 2 , 
\T'(F 2 (t))\ = F 2 (t)- 2 = (2 + t) 2 , 
\T'(F 3 (t))\ = F 3 (tr 2 = (2-i) 2 . 

So Q reduces to 

/(t) = ; VTTtJ (l + 2t) 2+/ V2T7J (2+^ + / V2^J (2^t)2" (U) 
Check directly that fit) = 1/(£(1 - *)) satisfies (JUJ). 
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To complete the proof of Theorem 4, we need to verify that [i is the 
push- forward of the measure v under D. Denote 



x = x w = ^-^i y = y\P) 



Compute: 



1 + t 2 ' " * w 1 + t 2 



2t 2 (l-t) 2 V 1/2 1 + * 



y/(l-x)(l-y) \l+t 2 1 + t 2 J y/2t(l-t)' 

ds = ^dx 2 + dy 2 = ^x'{t) 2 + y'{t) 2 dt 

2dt 

= 1 + t 2 ' 




V2t(i-t) ^(i- x )(i- y y 

as claimed. 

Note that this also proves that \x is T- invariant. This fact could be checked 
directly, of course. 

3.2 Interpretation as a Euclidean algorithm 

The ordinary Euclidean algorithm takes a pair of positive integers (x, y) with 
x > y and returns the pair (y, x mod y). After a finite number of iterations 
of this mapping, y will be equal to and x will be equal to the g.c.d. of the 
original pair. 

Many variants of this algorithm have been analyzed, where various al- 
ternatives to simple division with remainder are used. The study of such 
algorithms, related of course to continued fraction variants, is a huge subject 
which it is beyond the scope of this paper to describe. See [2T]; sections 
4.5.2-4.5.3 in ^H] and the references there; and [S] for some more recent 
developments. 

The standard Euclidean algorithm has a more ancient version, known 
as the slow, or subtractive Euclidean algorithm, where subtraction is used 
instead of division, so (x,y) are mapped to (max(x — y,y), min(x — y,y)). 
Clearly the standard algorithm is nothing more than a speeding-up of this 
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algorithm. One may scale by always replacing a: by 1 and y by the ratio y/x. 
This leads to the interval map R : (0, 1) — > (0, 1), 



R(t) 



T~ t 0<t < 1/2, 
*=*■ 1/2 < t < 1 




0.2 0.4 0.6 0. 



We now observe that the map T is itself the scaling of a modified algorithm, 
defined by the mapping 

(x-2y,y) iix-2y>y, 
(x,y), x>y — ► (y,x-2y) iiy>x-2y>0, 

(y, 2y — x) if x — 2y < 0. 

Here is a sample execution sequence of this algorithm: 

(155, 100) — ► (100, 45) — ► (45, 10) — ► (25, 10) — ► (10, 5) — > (5, 0). 

This algorithm can be used to compute g.c.d.'s, just like its famous kin: The 
last output which differs from the one preceding it is of either the form (a, a) 
or (a, 0), where a is the g.c.d. of the two original integers. 

3.3 Ergodic properties of T 

To prove Theorem 3, we study the somewhat simpler measure preserving 
system ((0, 1),T, v). Since this is now represented as an interval map, we 
can use standard techniques of ergodic theory. 

Define d(t) = d(D(t)). Define the set J = (1/5,2/3). An alternative 
description for J is as the cylinder set 

J = It e (0,1): d(t) = 2 or (d{t) = 1 and d(f(t)) ^ l) 

or (d(t) = 3 and d(f(t)) + 3) 
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By Theorem 2(ii), for any irrational t G (0, 1), T n (t) G J for some n > 0. In 
other words, 

oo 

(0 } l)=\Jf- n (J) a.e. 

n=0 

This implies that T is conservative, by [I], Theorem 1.1.7. 

To prove that T is ergodic, we pass to the induced system (J,Tj,/a\j), 
where 

fj{t) = f vAt \t), 

(pj(t) = inf{n > 1 : T n (t) G J}. 

By the explicit description of Tj given in |2], p. 16, it follows ([2], Lemma 5.2) 
that Tj : J — ► J is a topologically mixing Markov map which is uniformly 
expanding with bounded distortion, i.e., satisfies 

IT" Ml 
inf \T'j(t)\ > 1, sup 1 Z J{ ,l < oo. 

tzj{T>{t)Y 

Therefore ([I], Theorem 4.4.7) it is exact, and in particular it is ergodic. It 
follows (PP, Proposition 1.5.2(2)) that T is itself ergodic. This completes the 
proof of Theorem 3. 

4 Expansions of random Q-points and ran- 
dom PPTs 

4.1 Random points on Q 

The invariant measure /x becomes infinite near the two ends of the quadrant 
Q. This means that in a typical expansion, the digits "1" and "3" will occur 
infinitely more often than the middle digit "2" . However, for any two digit 
sequences, even ones that contain the digits "1" and "3", we can ask about 
their relative density of occurence in the expansion of typical points. 

Theorem 6. Let h = (0, 1/3), I 2 = (1/3, 1/2), I 3 = (1/2, 1) be the intervals 
of the generating partition ofT. For (di, . . . , d n ) G U^ :1 {1, 2, 3} e , define 

A(d 1 ,...,d n ) = v((\f-i +1 (I d .)\ = V ((F dl oF^o...oF dn )({0,l))\ 
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with F 1 , F 2 , F 3 as in (HUD- Let (d u ...,d n ), (e 1 , ...,e m )e U£ 1 {1, 2, 3} £ . For 
almost every (x, y) e Q, i/je /zmit 

#jo < k< N : d{T k+ ^~ 1 (x,y)) = d s , l<j<n\ 

lim -4 i- 

#< < k < N : ^(T^- 1 ^, j/)) = e i? 1 < j < m \ 

exists and is equal to A (di, ... , d n )/A(ei, . . . , e m ) . 

Proof. This is an immediate consequence of Hopf's ergodic theorem, ap- 
plied to the two indicator functions of the cylinder sets n™ =1 T _:,+1 (/ di ) and 

nf =1 f-^\I ej ). m 

Example. An easy computation gives 

4(1,2) _ log(4/3) _ 0.2876 
4(1,3) ~ log(3/2) W 0.4055' 

Therefore, in a typical expansion, when a run of consecutive l's breaks, the 
next digit will be a "2" with probability 0.2876/(0.2876 + 0.4055) w 0.415, 
or a "3" with probability 0.4055/(0.2876 + 0.4055) w 0.585. 

4.2 Random PPTs 

PPTs have finite expansions and form a subset of Q of measure 0. So, as the 
analogous studies of continued fraction expansions of rational numbers (a.k.a. 
analysis of the Euclidean algorithm) have shown, analyzing their behavior 
can be significantly more difficult than the behavior of expansions of random 
points on Q. We outline here a technique for easily deducing some of the 
properties of the expansion by relating the discrete model to the continuous 
one. We mention some open problems which may be approachable using 
more sophisticated methods such as those used in [S], and which we hope to 
address at a later date. 

Our model for random PPTs will be the discrete probability space 

PPTjv = {(a, b, c) e PPT :c<N}, 
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equipped with the uniform probability measure P^. Analogous results can 
easily be formulated, using the same ideas presented here, for other natural 
models, e.g., a uniform choice of (a, b, c) G PPT with \a\, \b\ < N. 

We discuss the distribution of the individual digits in the expansion. Let 
A be the uniform arc-length measure on Q, normalized as a probability mea- 
sure. That is, dX = (2/ir)ds. We need the following simple lemma. 

Lemma 7. Under the measure F^, the random vector (a/c, b/c) converges 
in distribution to X. 

Proof. Observe the following fact from elementary number theory: the 
coprime pairs (m, n) such that m, n are of opposite parity have a local density 
of 4/-7T 2 in the lattice Z 2 , in the following sense: for any bounded open set 
D C I 2 , we have 

— #< (m, n) G Z 2 : ( — , — ) G D, gcd(m, n) = I, m + n = l(mod 2) > 
x 2 i \x x) J 

4 
— area(D). (12) 



o 

x— >oo 7T 



First, this is true when D is a rectangle (0, A) x (0, B). To prove this, define 
for i = 0, 1, 



&(«; fc) = #< J G Z : < j < u, j = i(mod 2), k \ j 
Let (fJL2(k))k>i be the coefficients of the Dirichlet series 

oo 
p>2 prime fc=l 



Then the left-hand side of ()T2|) is equal to 

1 oo 

— J^ /i 2 (A;) [g (Ax; k)g x {Bx\ k) + <7i(Ac; k)g (Bx; k)] . 



x* 
fe=i 



Since clearly |^(w;A;) — («/2A;)| < 1 (for k odd), this is easily seen to be 
cAB + 0((\ogx)/x) as x — > oo, where 

fe=l 
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proving our claim. 

It follows, by taking unions and differences, that (|12)) is true for D any 
finite union of rectangles with sides parallel to the coordinate axes, and 
therefore by approximation for any bounded and open D. 

For < t < 7r/2, denote 



arc(t) = < (cosm, sinw) : < u < t 

sector (t) = < (x, y) : x, y > 0, x 2 + y 2 < 1, arctan(?//x) < t }>. 
By the parametric solution © we have as iV — > oo 
¥ N I (a, b, c) E PPTiv : (a/c, b/c) E arc(t) 

# (m,n)eZ 2 : m > n > 0, gcd(m,n) = l, m + n =1(2), m 2 + n 2 < N, arctan ( Jy""^ ) < 1 1 

#J(m,n)eZ 2 : m > n > 0, gcd(m,n) = 1, m + n = 1(2), m 2 +n 2 <wl 
#j(m,n) e Z 2 : gcd(m,n) = l, m + n = 1(2), (-5L, -5=) e scctor(t/2) 1 
#j(m,n)eZ 2 : gcd(m,n) = l, m + n. = 1(2), f-^=, -7=) 6 sector (tt/4) I 

,_ ,„.. 47T -2 • area(sector(£/2))./V .„ /1NN 2£ 

= 1 + o1 t-=5 7 , r /^?v = (l + o(l)) ~- 

Air z ■ area(sector(7r/4)JiV ir 

This is exactly the claim of the Lemma. ■ 

Define the Perron-Frobenius operator of T as the operator H : Li(Q, A) — > 
£i(Q,A), 

{Hf)(x,y) = —— =-•/ 



3 + 2a:-2?/ V 3 + 2a; - % 3 + 2x ~ 2 V 
1 ( 2 + x + 2y 2 + 2x + y 



3 + 2x + 2y \3 + 2x + 2y' 3 + 2x + 2y 
1 ( 2-x + 2y 2-2x + y 



3-2x + 2y " \3-2x + 2y 1 3-2x + 2y / 

H is also known as the transfer operator corresponding to T. It has the 
property that if the random vector (X, Y) on Q has distribution f(x, y)dX, 
then T(X, Y) has distribution (Hf)(x, y)d\. We skip the simple computation 
that verifies this claim. 
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Theorem 8. UnderF^, the distribution ofd(T n ~ 1 (a/c,b/c)), then-th digit 
in the expansion of a random PPT (a, b, c) G PPT^, converges to the distri- 
bution of d(x, y) under the measure 

d\ n (x,y) = (H n ~ l (l))(x,y)d\(x,y), 

where 1 is the constant function 1 . 

Proof. This is immediate from Lemma 7 and the definition of H . ■ 

Theorem 8 answers the question of the limiting distribution as N — ► oo 
of the digits d n in the expansion of a random PPT; however, it does not 
give good asymptotic information on the behavior of these distributions as 
n — > oo. In fact, this is not a very interesting question: since the invariant 
measure is infinite, the density H n (l) will become for large n more and 
more concentrated around the singular points (1, 0) and (0, 1) (it is possible 
to make this statement more precise, but we do not pursue this slightly 
technical issue here). 

Here's one way to amend the situation in a way that enables formulating 
interesting quantitative statements concerning the distribution of the digits, 
which we mention briefly without going into detail: replace the expansion 
(dk)k=i by a new expansion (ej)* =1 , by dividing the (c4) into blocks consist- 
ing of l's and 3's and terminating with a 2; so for instance, the expansion 
(1,1,2,2,3,1,3,2,3,3,3,1,2,1) will be replaced by (112,2,3132,33312,1). 
The new expansion corresponds to the induced transformation Tb, where B 
is the middle arc in the generating partition. Cylinder sets of Tb can be easily 
computed. The invariant measure is the restriction /j,\ B , a finite measure, so 
normalize it to be a probability measure. Tg is easily shown to be exact as in 
section 3.3. A theorem analogous to Theorem 8 above can be proved, to the 
effect that the n-th digit in the "new" expansion of a random PPT converges 
in distribution to the distribution of d ncw (x,y) (the first "new" digit) under 
the measure whose density with respect to A is the (n — l)-th iterate of the 
Perron- Frobenius operator of Tb applied to the constant function 1. Since Tb 
is mixing, these densities will actually converge to the invariant density. So 
after each occurence of a "2" in the original expansion, there are well-defined 
statistics for the sequence of digits that follows up to the next "2" . 

We conclude with some open problems: study the expectation, the vari- 
ance and the limiting distribution of the length of the expansion of a random 
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element of PPTjy, as iV — > oo. Generalize to arbitrary cost-functions, as in 

m 

5 Some special expansions 

From a number-theoretic standpoint, it is interesting to study points on Q 
with special expansions. As the examples in section 1 show, simple periodic 
expansions seem to correspond to simple quadratic points on Q. It is easy to 
see that any eventually-periodic expansion corresponds to the image under D 
of a quadratic irrational. Do all quadratic irrationals have eventually periodic 
expansions? 

We also found empirically the expansions 

e l = (cos 1, sin 1) = [3, l 2 , 3, l 4 , 3, l 6 , 3, l 8 , 3, . . .], 
e^ 2 = (cos 1/2, sin 1/2) = [1, 3, l 5 , 3, l 9 , 3, l 13 , 3, . . .]. 

where l k means a succession of k l's. The first equation can be proved by 
observing that -D _1 (cos 1, sinl) = (1 — cos(l))/sin(l) = tan(l/2), and that 
the approximations 

(F 3 o F 2 o F 3 o F 4 o . . . o F 3 o Fl k ) (1) 

(iq, F 2 , F 3 as in (fTUjl ) have the continued fraction expansions 



111111 1 


1 


1 + 1 + 4+1 + 8 + 1 + 12+' 


+ 4k 



1 




1 




1 




1 




1 




1 




1 


1 


+ 


1 


+ 


4 


+ 


1 


+ 


8 


+ 


1 


+ 


12 



Thus our expansion reduces to the known ([24j. sequence A019425) infinite 
continued fraction expansion 

tan(l/2) 

The second equation is proved similarly. It is interesting to wonder whether 
other "nice" expansions exist for simple points on Q. 
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